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SEQUENCE LISTING 

) GENERAL INFORMATION: 

(i) APPLICANT: BURRELL , MARILEE 
HILL, DAVID E. 
KINZLER, KENNETH W. 
VOGELSTEIN, BERT 

(ii) TITLE OF INVENTION: AMPLIFICATION OF HUMAN MDM2 GENE IN 
HUMAN TUMORS 

(iii) NUMBER OF SEQUENCES: 5 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: BANNER, BIRCH, MCKIE AND BECKETT 

(B) STREET: 1001 G STREET, N.W. 
<C) CITY: WASHINGTON 

(D) STATE: D-C. 

(E) COUNTRY: USA 

(F) ZIP: 20001 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

<C) OPERATING SYSTEM: PC-DOS /MS -DOS 

(D) SOFTWARE: Patentln Release #1,0, Version #1*25 
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<vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 07-APR-1993 

(C) CLASSIFICATION: 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: RAG AN, SARAH A. 

(B) REGISTRATION NUMBER: 32,141 

(C) REFERENCE /DOCKET NUMBER: 01107.42798 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 202-508-9100 

(B) TELEFAX: 202-508-9299 

(C) TELEX: 197430 BBMB UT 

(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(v) FRAGMENT TYPE: N-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

<viii) POSITION IN GENOME: 

(A) CHROMOSOME /SEGMENT : 17q 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

Met Glu Glu Pro Gin Ser Asp Pro Ser Val Glu Pro Pro Leu Ser Gin 
15 10 15 

Glu Thr Phe Ser Asp Leu Trp Lys Leu Leu Pro Glu Asn Asn Val Leu 
20 25 30 

Ser Pro Leu Pro Ser Gin Ala Met Asp Asp Leu Met Leu Ser Pro Asp 
35 40 45 

Asp lie Glu Gin Trp Phe Thr Glu Asp Pro Gly Pro Asp Glu Ala Pro 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2372 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(H) CELL LINE: CaCo-2 

(viii) POSITION IN GENOME: 

<B) MAP POSITION: 12ql2-14 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

<B) LOCATION: 312.. 1784 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

GCACCGCGCG AGCTTGGCTG CTTCTGGGGC CTGTGTGGCC CTGTGTGTCG GAAAGATGGA 60 

GCAAGAAGCC GAGCCCGAGG GGCGGCCGCG ACCCCTCTGA CCGAGATCCT GCTGCTTTCG 120 

CAGCCAGGAG CACCGTCCCT CCCCGGATTA GTGCGTACGA GCGCCCAGTG CCCTGGCCCG 180 
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GAGAGTGGAA TGATCCCCGA GGCCCAGGGC GTCGTGCTTC CGCAGTAGTC AGTCCCCGTG 240 

AAGGAAACTG GGGAGTCTTG AGGGACCCCC GACTCCAAGC GCGAAAACCC CGGATGGTGA 300 

GGAGCAGGCA A ATG TGC AAT ACC AAC ATG TCT GTA CCT ACT GAT GGT GCT 350 
Met Cys Asn Thr Asn Met Ser Val Pro Thr Asp Gly Ala 

15 10 

GTA ACC ACC TCA CAG ATT CCA GCT TCG GAA CAA GAG ACC CTG GTT AGA 398 
Val Thr Thr Ser Gin lie Pro Ala Ser Glu Gin Glu Thr Leu Val Arg 
15 20 25 

CCA AAG CCA TTG CTT TTG AAG TTA TTA AAG TCT GTT GGT GCA CAA AAA 446 
Pro Lys Pro Leu Leu Leu Lys Leu Leu Lys Ser Val Gly Ala Gin Lys 
30 35 40 45 

GAC ACT TAT ACT ATG AAA GAG GTT CTT TTT TAT CTT GGC CAG TAT ATT 494 
Asp Thr Tyr Thr Met Lys Glu Val Leu Phe Tyr Leu Gly Gin Tyr lie 
50 55 60 

ATG ACT AAA CGA TTA TAT GAT GAG AAG CAA CAA CAT ATT GTA TAT TGT 542 
Met Thr Lys Arg Leu Tyr Asp Glu Lys Gin Gin His lie Val Tyr Cys 
65 70 75 



TCA AAT 



Ser Asn 



GAT CTT CTA GGA GAT TTG TTT GGC GTG CCA AGC TTC TCT GTG 
Asp Leu Leu Gly Asp Leu Phe Gly Val Pro Ser Phe Ser Val 
80 85 90 



590 
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AAA GAG CAC AGG AAA ATA TAT ACC ATG ATC TAC AGG AAC TTG GTA GTA 638 
Lys Glu His Arg Lys lie Tyr Thr Met lie Tyr Arg Asn Leu Val Val 
95 100 105 

GTC AAT CAG CAG GAA TCA TCG GAC TCA GGT ACA TCT GTG AGT GAG AAC 686 
Val Asn Gin Gin Glu Ser Ser Asp Ser Gly Thr Ser Val Ser Glu Asn 
110 115 120 125 

AGG TGT CAC CTT GAA GGT GGG AGT GAT CAA AAG GAC CTT GTA CAA GAG 734 
Arg Cys His Leu Glu Gly Gly Ser Asp Gin Lys Asp Leu Val Gin Glu 
130 135 140 

CTT CAG GAA GAG AAA CCT TCA TCT TCA CAT TTG GTT TCT AG A CCA TCT 782 
Leu Gin Glu Glu Lys Pro Ser Ser Ser His Leu Val Ser Arg Pro Ser 
145 150 155 

ACC TCA TCT AGA AGG AGA GCA ATT AGT GAG ACA GAA GAA AAT TCA GAT 830 

Thr Ser Ser Arg Arg Arg Ala lie Ser Glu Thr Glu Glu Asn Ser Asp 
160 165 170 

GAA TTA TCT GGT GAA CGA CAA AGA AAA CGC CAC AAA TCT GAT AGT ATT 878 
Glu Leu Ser Gly Glu Arg Gin Arg Lys Arg His Lys Ser Asp Ser lie 
175 180 185 

TCC CTT TCC TTT GAT GAA AGC CTG GCT CTG TGT GTA ATA AGG GAG ATA 92 6 

Ser Leu Ser Phe Asp Glu Ser Leu Ala Leu Cys Val lie Arg Glu lie 
190 195 200 205 
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TGT TGT GAA AGA AGC AGT AGC AGT GAA TCT ACA GGG ACG CCA TCG AAT 974 
Cys Cys Glu Arg Ser Ser Ser Ser Glu Ser Thr Gly Thr Pro Ser Asn 
210 215 220 

CCG GAT CTT GAT GCT GGT GTA AGT GAA CAT TCA GGT GAT TGG TTG GAT 1022 
Pro Asp Leu Asp Ala Gly Val Ser Glu His Ser Gly Asp Trp Leu Asp 
225 230 235 

CAG GAT TCA GTT TCA GAT CAG TTT AGT GTA GAA TTT GAA GTT GAA TCT 1070 
Gin Asp Ser Val Ser Asp Gin Phe Ser Val Glu Phe Glu Val Glu Ser 
240 245 250 

CTC GAC TCA GAA GAT TAT AGC CTT AGT GAA GAA GGA CAA GAA CTC TCA 1118 
Leu Asp Ser Glu Asp Tyr Ser Leu Ser Glu Glu Gly Gin Glu Leu Ser 
255 260 265 

GAT GAA GAT GAT GAG GTA TAT CAA GTT ACT GTG TAT CAG GCA GGG GAG 1166 
Asp Glu Asp Asp Glu Val Tyr Gin Val Thr Val Tyr Gin Ala Gly Glu 
270 275 280 285 

AGT GAT ACA GAT TCA TTT GAA GAA GAT CCT GAA ATT TCC TTA GCT GAC 1214 
Ser Asp Thr Asp Ser Phe Glu Glu Asp Pro Glu lie Ser Leu Ala Asp . 

290 295 300 



TAT 
Tyr 



TGG AAA 
Trp Lys 



TGC ACT TCA TGC AAT GAA ATG AAT CCC CCC CTT CCA TCA 
Cys Thr Ser Cys Asn Glu Met Asn Pro Pro Leu Pro Ser 
305 310 315 



1262 
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CAT TGC AAC AGA TGT TGG GCC CTT CGT GAG AAT TGG CTT CCT GAA GAT 1310 
His Cys Asn Arg Cys Trp Ala Leu Arg Glu Asn Trp Leu Pro Glu Asp 
320 325 330 

AAA GGG AAA GAT AAA GGG GAA ATC TCT GAG AAA GCC AAA CTG GAA AAC 1358 
Lys Gly Lys Asp Lys Gly Glu He Ser Glu Lys Ala Lys Leu Glu Asn 
335 340 345 

TCA ACA CAA GCT GAA GAG GGC TTT GAT GTT CCT GAT TGT AAA AAA ACT 1406 
Ser Thr Gin Ala Glu Glu Gly Phe Asp Val Pro Asp Cys Lys Lys Thr 
350 355 360 365 

ATA GTG AAT GAT TCC AGA GAG TCA TGT GTT GAG GAA AAT GAT GAT AAA 1454 
He Val Asn Asp Ser Arg Glu Ser Cys Val Glu Glu Asn Asp Asp Lys 
370 375 380 

ATT ACA CAA GCT TCA CAA TCA CAA GAA AGT GAA GAC TAT TCT CAG CCA 1502 
He Thr Gin Ala Ser Gin Ser Gin Glu Ser Glu Asp Tyr Ser Gin Pro 
385 390 395 

TCA ACT TCT AGT AGC ATT ATT TAT AGC AGC CAA GAA GAT GTG AAA GAG 1550 
Ser Thr Ser Ser Ser He He Tyr Ser Ser Gin Glu Asp Val Lys Glu 
400 405 410 



TTT GAA AGG GAA GAA ACC CAA GAC AAA GAA GAG AGT GTG GAA TCT AGT 
Phe Glu Arg Glu Glu Thr Gin Asp Lys Glu Glu Ser Val Glu Ser Ser 
415 420 425 



1598 
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TTG CCC CTT AAT GCC ATT GAA CCT TGT GTG ATT TGT CAA GGT CGA CCT 1646 
Leu Pro Leu Asn Ala lie Glu Pro Cys Val lie Cys Gin Gly Arg Pro 
430 435 440 445 

AAA AAT GGT TGC ATT GTC CAT GGC AAA ACA GGA CAT CTT ATG GCC TGC 1694 
Lys Asn Gly Cys lie Val His Gly Lys Thr Gly His Leu Met Ala Cys 
450 455 460 

TTT ACA TGT GCA AAG AAG CTA AAG AAA AGG AAT AAG CCC TGC CCA GTA 1742 
Phe Thr Cys Ala Lys Lys Leu Lys Lys Arg Asn Lys Pro Cys Pro Val 
465 470 475 

TGT AGA CAA CCA ATT CAA ATG ATT GTG CTA ACT TAT TTC CCC 1784 
Cys Arg Gin Pro lie Gin Met He Val Leu Thr Tyr Phe Pro 
480 485 490 

TAGTTGACCT GTCTATAAGA GAATTATATA TTTCTAACTA TATAACCCTA GGAATTTAGA 1844 

CAACCTGAAA TTTATTCACA TATATCAAAG TGAGAAAATG CCTCAATTCA CATAGATTTC 1904 

TTCTCTTTAG TATAATTGAC CTACTTTGGT AGTGGAATAG TGAATACTTA CTATAATTTG 1964 

ACTTGAATAT GTAGCTCATC CTTTACACCA ACTCCTAATT TTAAATAATT TCTACTCTGT 2024 

CTTAAATGAG AAGTACTTGG TTTTTTTTTT CTTAAATATG TATATGACAT TTAAATGTAA 2084 

CTTATTATTT TTTTTGAGAC CGAGTCTTGC TCTGTTACCC AGGCTGGAGT GCAGTGGGTG 2144 

ATCTTGGCTC ACTGCAAGCT CTGCCCTCCC CGGGTTCGCA CCATTCTCCT GCCTCAGCCT 2204 
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CCCAATTAGC TTGGCCTACA GTCATCTGCC ACCACACCTG GCTAATTTTT TGTACTTTTA 2264 
GTAGAGACAG GGTTTCACCG TGTTAGCCAG GATGGTCTCG ATCTCCTGAC CTCGTGATCC 2324 
GCCCACCTCG GCCTCCCAAA GTGCTGGGAT TACAGGCATG AGCCACCG 2372 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 491 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 3: 

Met Cys Asn Thr Asn Met Ser Val Pro Thr Asp Gly Ala Val Thr Thr 
15 10 15 

Ser Gin lie Pro Ala Ser Glu Gin Glu Thr Leu Val Arg Pro Lys Pro 
20 25 30 

Leu Leu Leu Lys Leu Leu Lys Ser Val Gly Ala Gin Lys Asp Thr Tyr 
35 40 45 



Thr Met Lys Glu Val Leu Phe Tyr Leu Gly Gin Tyr lie Met Thr Lys 
50 55 60 
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Arg Leu Tyr Asp Glu Lys Gin Gin His lie Val Tyr Cys Ser Asn Asp 
65 70 75 80 

Leu Leu Gly Asp Leu Phe Gly Val Pro Ser Phe Ser Val Lys Glu His 
85 90 95 

Arg Lys He Tyr Thr Met He Tyr Arg Asn Leu Val Val Val Asn Gin 
100 105 HO 

Gin Glu Ser Ser Asp Ser Gly Thr Ser Val Ser Glu Asn Arg Cys His 
115 120 125 

Leu Glu Gly Gly Ser Asp Gin Lys Asp Leu Val Gin Glu Leu Gin Glu 
130 135 140 

Glu Lys Pro Ser Ser Ser His Leu Val Ser Arg Pro Ser Thr Ser Ser 
145 150 155 160 

Arg Arg Arg Ala He Ser Glu Thr Glu Glu Asn Ser Asp Glu Leu Ser 
165 170 175 

Gly Glu Arg Gin Arg Lys Arg His Lys Ser Asp Ser He Ser Leu Ser 
180 185 190 

Phe Asp Glu Ser Leu Ala Leu Cys Val He Arg Glu He Cys Cys Glu 
195 200 205 

Arg Ser Ser Ser Ser Glu Ser Thr Gly Thr Pro Ser Asn Pro Asp Leu 
210 215 220 
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Asp Ala Gly Val Ser Glu His Ser Gly Asp Trp Leu Asp Gin Asp Ser 
225 230 235 240 

Val Ser Asp Gin Phe Ser Val Glu Phe Glu Val Glu Ser Leu Asp Ser 
245 250 255 

Glu Asp Tyr Ser Leu Ser Glu Glu Gly Gin Glu Leu Ser Asp Glu Asp 
260 265 270 

Asp Glu Val Tyr Gin Val Thr Val Tyr Gin Ala Gly Glu Ser Asp Thr 
275 280 285 

Asp Ser Phe Glu Glu Asp Pro Glu lie Ser Leu Ala Asp Tyr Trp Lys 
290 295 300 

Cys Thr Ser Cys Asn Glu Met Asn Pro Pro Leu Pro Ser His Cys Asn 
305 310 315 320 

Arg Cys Trp Ala Leu Arg Glu Asn Trp Leu Pro Glu Asp Lys Gly Lys 
325 330 335 

Asp Lys Gly Glu lie Ser Glu Lys Ala Lys Leu Glu Asn Ser Thr Gin 
340 345 350 

Ala Glu Glu Gly Phe Asp Val Pro Asp Cys Lys Lys Thr He Val Asn 
355 360 365 



Asp Ser Arg Glu Ser Cys Val Glu Glu Asn Asp Asp Lys He Thr Gin 
370 375 380 



- 45 - 



Ala Ser Gin Ser Gin Glu Ser Glu Asp Tyr Ser Gin Pro Ser Thr Ser 
385 390 395 400 

Ser Ser lie lie Tyr Ser Ser Gin Glu Asp Val Lys Glu Phe Glu Arg 
405 410 415 

Glu Glu Thr Gin Asp Lys Glu Glu Ser Val Glu Ser Ser Leu Pro Leu 
420 425 430 

Asn Ala He Glu Pro Cys Val He Cys Gin Gly Arg Pro Lys Asn Gly 
435 440 445 

Cys He Val His Gly Lys Thr Gly His Leu Met Ala Cys Phe Thr Cys 
450 455 460 

Ala Lys Lys Leu Lys Lys Arg Asn Lys Pro Cys Pro Val Cys Arg Gin 
465 470 475 480 

Pro He Gin Met He Val Leu Thr Tyr Phe Pro 
485 490 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



(iii) HYPOTHETICAL: NO 



<iv) ANT I -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 202.. 1668 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

GAGGAGCCGC CGCCTTCTCG TCGCTCGAGC TCTGGACGAC CATGGTCGCT CAGGCCCCGT 60 

CCGCGGGGCC TCCGCGCTCC CCGTGAAGGG TCGGAAGATG CGCGGGAAGT AGCAGCCGTC 120 

TGCTGGGCGA GCGGGAGACC GACCGGACAC CCCTGGGGGA CCCTCTCGGA TCACCGCGCT 180 

TCTCCTGCGG CCTCCAGGCC A ATG TGC AAT ACC AAC ATG TCT GTG TCT ACC 231 

Met Cys Asn Thr Asn Met Ser Val Ser Thr 
15 10 



GAG GGT GCT GCA AGC ACC TCA CAG ATT CCA GCT TCG GAA CAA GAG ACT 
Glu Gly Ala Ala Ser Thr Ser Gin lie Pro Ala Ser Glu Gin Glu Thr 
15 20 25 



279 
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CTG GTT AG A CCA AAA CCA TTG CTT TTG AAG TTG TTA AAG TCC GTT GGA 327 
Leu Val Arg Pro Lys Pro Leu Leu Leu Lys Leu Leu Lys Ser Val Gly 
30 35 40 

GCG CAA AAC GAC ACT TAC ACT ATG AAA GAG ATT ATA TTT TAT ATT GGC 375 
Ala Gin Asn Asp Thr Tyr Thr Met Lys Glu He He Phe Tyr He Gly 
45 50 55 

CAG TAT ATT ATG ACT AAG AGG TTA TAT GAC GAG AAG CAG CAG CAC ATT 423 
Gin Tyr He Met Thr Lys Arg Leu Tyr Asp Glu Lys Gin Gin His He 
60 65 70 

GTG TAT TGT TCA AAT GAT CTC CTA GGA GAT GTG TTT GGA GTC CCG AGT 471 
Val Tyr Cys Ser Asn Asp Leu Leu Gly Asp Val Phe Gly Val Pro Ser 
75 80 85 90 

TTC TCT GTG AAG GAG CAC AGG AAA ATA TAT GCA ATG ATC TAC AGA AAT 519 
Phe Ser Val Lys Glu His Arg Lys He Tyr Ala Met He Tyr Arg Asn 
95 100 105 

TTA GTG GCT GTA AGT CAG CAA GAC TCT GGC ACA TCG CTG AGT GAG AGC 567 
Leu Val Ala Val Ser Gin Gin Asp Ser Gly Thr Ser Leu Ser Glu Ser 
110 115 120 



AGA 
Arg 



CGT CAG CCT GAA GGT GGG AGT GAT CTG AAG GAT CCT TTG CAA GCG 
Arg Gin Pro Glu Gly Gly Ser Asp Leu Lys Asp Pro Leu Gin Ala 
125 130 135 



615 



-48 - 



CCA CCA GAA GAG AAA CCT TCA TCT TCT GAT TTA ATT TCT AGA CTG TCT 
Pro Pro Glu Glu Lys Pro Ser Ser Ser Asp Leu lie Ser Arg Leu Ser 
140 145 150 

ACC TCA TCT AGA AGG AGA TCC ATT AGT GAG ACA GAA GAG AAC ACA GAT 
Thr Ser Ser Arg Arg Arg Ser lie Ser Glu Thr Glu Glu Asn Thr Asp 
155 160 165 170 

GAG CTA CCT GGG GAG CGG CAC CGG AAG CGC CGC AGG TCC CTG TCC TTT 
Glu Leu Pro Gly Glu Arg His Arg Lys Arg Arg Arg Ser Leu Ser Phe 
175 180 185 

GAT CCG AGC CTG GGT CTG TGT GAG CTG AGG GAG ATG TGC AGC GGC GGC 
Asp Pro Ser Leu Gly Leu Cys Glu Leu Arg Glu Met Cys Ser Gly Gly 
190 195 200 

ACG AGC AGC AGT AGC AGC AGC AGC AGC GAG TCC ACA GAG ACG CCC TCG 
Thr Ser Ser Ser Ser Ser Ser Ser Ser Glu Ser Thr Glu Thr Pro Ser 
205 210 215 

CAT CAG GAT CTT GAC GAT GGC GTA AGT GAG CAT TCT GGT GAT TGC CTG 
His Gin Asp Leu Asp Asp Gly Val Ser Glu His Ser Gly Asp Cys Leu 
220 225 230 

GAT CAG GAT TCA GTT TCT GAT CAG TTT AGC GTG GAA TTT GAA GTT GAG 
Asp Gin Asp Ser Val Ser Asp Gin Phe Ser Val Glu Phe Glu Val Glu 
235 240 245 250 
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TCT CTG GAC TCG GAA GAT TAC AGC CTG AGT GAC GAA GGG CAC GAG CTC 999 
Ser Leu Asp Ser Glu Asp Tyr Ser Leu Ser Asp Glu Gly His Glu Leu 
255 260 265 

TCA GAT GAG GAT GAT GAG GTC TAT CGG GTC ACA GTC TAT CAG ACA GGA 1047 
Ser Asp Glu Asp Asp Glu Val Tyr Arg Val Thr Val Tyr Gin Thr Gly 
270 275 280 

GAA AGC GAT ACA GAC TCT TTT GAA GGA GAT CCT GAG ATT TCC TTA GCT 1095 
Glu Ser Asp Thr Asp Ser Phe Glu Gly Asp Pro Glu lie Ser Leu Ala 
285 290 295 

GAC TAT TGG AAG TGT ACC TCA TGC AAT GAA ATG AAT CCT CCC CTT CCA 1143 
Asp Tyr Trp Lys Cys Thr Ser Cys Asn Glu Met Asn Pro Pro Leu Pro 
300 305 310 

TCA CAC TGC AAA AGA TGC TGG ACC CTT CGT GAG AAC TGG CTT CCA GAC 1191 
Ser His Cys Lys Arg Cys Trp Thr Leu Arg Glu Asn Trp Leu Pro Asp 
315 320 325 330 

GAT AAG GGG AAA GAT AAA GTG GAA ATC TCT GAA AAA GCC AAA CTG GAA 1239 
Asp Lys Gly Lys Asp Lys Val Glu lie Ser Glu Lys Ala Lys Leu Glu 
335 340 345 



AAC 
Asn 



TCA 
Ser 



GCT CAG GCA GAA GAA GGC TTG GAT GTG CCT GAT GGC AAA AAG 
Ala Gin Ala Glu Glu Gly Leu Asp Val Pro Asp Gly Lys Lys 
350 355 360 
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CTG ACA GAG AAT GAT GCT AAA GAG CCA TGT GCT GAG GAG GAC AGC GAG 1335 
Leu Thr Glu Asn Asp Ala Lys Glu Pro Cys Ala Glu Glu Asp Ser Glu 
365 370 375 

GAG AAG GCC GAA CAG ACG CCC CTG TCC CAG GAG AGT GAC GAC TAT TCC 1383 
Glu Lys Ala Glu Gin Thr Pro Leu Ser Gin Glu Ser Asp Asp Tyr Ser 
380 385 390 

CAA CCA TCG ACT TCC AGC AGC ATT GTT TAT AGC AGC CAA GAA AGC GTG 1431 
Gin Pro Ser Thr Ser Ser Ser lie Val Tyr Ser Ser Gin Glu Ser Val 
395 400 405 410 

AAA GAG TTG AAG GAG GAA ACG CAG CAC AAA GAC GAG AGT GTG GAA TCT 1479 
Lys Glu Leu Lys Glu Glu Thr Gin His Lys Asp Glu Ser Val Glu Ser 
415 420 425 

AGC TTC TCC CTG AAT GCC ATC GAA CCA TGT GTG ATC TGC CAG GGG CGG 1527 
Ser Phe Ser Leu Asn Ala lie Glu Pro Cys Val lie Cys Gin Gly Arg 
430 435 440 

CCT AAA AAT GGC TGC ATT GTT CAC GGC AAG ACT GGA CAC CTC ATG TCA 1575 
Pro Lys Asn Gly Cys lie Val His Gly Lys Thr Gly His Leu Met Ser 
445 450 455 

TGT TTC ACG TGT GCA AAG AAG CTA AAA AAA AGA AAC AAG CCC TGC CCA 1623 
Cys Phe Thr Cys Ala Lys Lys Leu Lys Lys Arg Asn Lys Pro Cys Pro 
460 465 470 
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GTG TGC AG A CAG CCA ATC CAA ATG ATT GTG CTA AGT TAC TTC AAC 1668 
Val Cys Arg Gin Pro lie Gin Met lie Val Leu Ser Tyr Phe Asn 
475 480 485 

TAGCTGACCT GCTCACAAAA ATAGAATTTT ATATTTCTAA CT 1710 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 489 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Cys Asn Thr Asn Met Ser Val Ser Thr Glu Gly Ala Ala Ser Thr 
15 10 15 

Ser Gin He Pro Ala Ser Glu Gin Glu Thr Leu Val Arg Pro Lys Pro 
20 25 30 

Leu Leu Leu Lys Leu Leu Lys Ser Val Gly Ala Gin Asn Asp Thr Tyr 
35 40 45 



Thr Met Lys Glu He He Phe Tyr lie Gly Gin Tyr He Met Thr Lys 
50 55 60 



Arg Leu Tyr 
65 

Leu Leu Gly 

Arg Lys lie 

Gin Asp Ser 
115 

Gly Ser Asp Leu 
130 

Ser Ser Ser Asp 
145 

Ser lie Ser Glu 

His Arg Lys Arg 
180 

Cys Glu Leu Arg 
195 



Asp Glu Lys Gin Gin 
70 

Asp Val Phe Gly Val 
85 

Tyr Ala Met lie Tyr 
100 

Gly Thr Ser Leu Ser 
120 

Lys Asp Pro Leu 
135 

Leu lie Ser Arg 
150 

Thr Glu Glu Asn 
165 

Arg Arg Ser Leu 

Glu Met Cys Ser 
200 
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His lie Val Tyr 
75 

Pro Ser Phe Ser 
90 

Arg Asn Leu Val 
105 

Glu Ser Arg Arg 

Gin Ala Pro Pro 
140 

Leu Ser Thr Ser 
155 

Thr Asp Glu Leu 
170 

Ser Phe Asp Pro 
185 

Gly Gly Thr Ser 



Cys Ser Asn Asp 
80 

Val Lys Glu His 
95 

Ala Val Ser Gin 
110 

Gin Pro Glu Gly 
125 

Glu Glu Lys Pro 

Ser Arg Arg Arg 
160 

Pro Gly Glu Arg 
175 

Ser Leu Gly Leu 
190 

Ser Ser Ser Ser 
205 



Ser Ser Ser Glu Ser Thr Glu Thr Pro Ser His Gin Asp Leu Asp Asp 
210 215 220 



-53- 



Gly Val Ser Glu His Ser Gly Asp Cys Leu Asp Gin Asp Ser Val Ser 
225 230 235 240 

Asp Gin Phe Ser Val Glu Phe Glu Val Glu Ser Leu Asp Ser Glu Asp 
245 250 255 



Tyr Ser Leu Ser Asp Glu Gly His 
260 

Val Tyr Arg Val Thr Val Tyr Gin 
275 280 



Glu Leu Ser Asp Glu Asp Asp Glu 
265 270 

Thr Gly Glu Ser Asp Thr Asp Ser 

285 



Phe Glu Gly Asp Pro Glu lie Ser Leu Ala Asp Tyr Trp Lys Cys Thr 
290 295 300 

Ser Cys Asn Glu Met Asn Pro Pro Leu Pro Ser His Cys Lys Arg Cys 
305 310 315 320 

Trp Thr Leu Arg Glu Asn Trp Leu Pro Asp Asp Lys Gly Lys Asp Lys 
325 330 335 

Val Glu lie Ser Glu Lys Ala Lys Leu Glu Asn Ser Ala Gin Ala Glu 
340 345 350 

Glu Gly Leu Asp Val Pro Asp Gly Lys Lys Leu Thr Glu Asn Asp Ala 
355 360 365 



Lys Glu Pro Cys Ala Glu Glu Asp Ser Glu Glu Lys Ala Glu Gin Thr 
370 375 380 
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Pro Leu Ser Gin Glu Ser Asp Asp 
385 390 

Ser He Val Tyr Ser Ser Gin Glu 
405 

Thr Gin His Lys Asp Glu Ser Val 
420 

lie Glu Pro Cys Val lie Cys Gin 
435 440 

Val His Gly Lys Thr Gly His Leu 
450 455 

Lys Leu Lys Lys Arg Asn Lys Pro 
465 470 

Gin Met lie Val Leu Ser Tyr Phe 
485 



Tyr Ser Gin Pro Ser Thr Ser Ser 
395 400 

Ser Val Lys Glu Leu Lys Glu Glu 
410 415 

Glu Ser Ser Phe Ser Leu Asn Ala 
425 430 

Gly Arg Pro Lys Asn Gly Cys lie 
445 

Met Ser Cys Phe Thr Cys Ala Lys 
460 

Cys Pro Val Cys Arg Gin Pro lie 
475 480 

Asn 



